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-X 

doc-id-l 
start-byte 
1  en 
-x 

doc-id-2 
s tart -byte 
1  en 

Notice  that  each  argument  must  be  placed  on  a  separate  line.  Thus,  in  a 
file-based  system,  delete-args-file  might  look  like  this: 

-X 

/usr /mart in/mail -archive 

4041968 

24601 

-x 

/usr /mart  in/mail-archive 
4121861 

94309 

Here  we  see  Martin  has  kept  mail  archives  with  multiple  documents  in 
each  file  and  provided  the  offsets  and  lengths  of  the  ones  to  be  ignored 
by  the  WAIS  server. 

Merging  and  Replacing  WAIS  Databases 

Two  other  waisindex  options  that  help  manage  changing  databases  are 
-merge  and  -replacewith. 

The  -merge  option  joins  two  sets  of  index  files.    It  takes  two 
arguments,  the  path  names  of  the  index  directories  to  be  combined.  For 
example, 

%  waisindex   -d   final-db   -merge  dbl  db2  -finalize 

will  merge  the  wais/ indexes /dbl  and  wais  /  indexes  /db2  index  files  into 
wais/ indexes /final-db.    Notice  that  the  destination  index  directory  is 
specified  as  an  argument  to  the  -d  switch.    In  addition,  the  -finalize 
switch  is  required  when  using  -merge;  it  makes  the  resulting,  merged 
index  searchable. 
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Multiple  display  formats  are  equally  useful  in  cases  where  documents 
exist  in  multiple  formats.    A  technical  report,  for  example,  may  be 
available  in  text,  Postscript,  and  Microsoft  Word  format  and  you  may 
want  to  make  all  three  formats  available  to  the  user.    This  can  be  done 
with  a  command  such  as: 

%  waisparse   -assoc  ps   PS   -assoc   doc  MS-WORD 
/wais/data/tech-reps/* . txt  | 
wais index  -d  /wais/ indexes /resume 

Here,  the  first  -assoc  switch  indicates  that  Postscript  files  use  the  file 
extension  .ps  and  the  PS  display  format.    The  second  -assoc  switch 
identifies  Microsoft  Word  files  as  those  with  .  doc  extensions  and  using 
the  MS -WORD  display  format.    The  files  that  are  parsed  and  indexed  are 
specified  as  /wais/data/tech-reps/*  .  txt  using  the  default  text  parse 
format.    The  Postscript  and  Microsoft  Word  format  files, 
/wais/data/tech-reps/*  .ps  and  /wais/data/tech-reps/*  .doc, 
respectively,  are  associated  with  these  text  files. 

The  .  src  File  and  the  Directory  of  Servers 

A  directory  of  servers  is  a  WAIS  database  made  up  of  .src  files.  Each 

database .  src  file  contains  information  about  a  particular  database. 

Once  you  have  built  the  index  for  your  database,  there  will  be  a 

database  .src  file  in  your  index  directory.    Using  your  favorite  editor, 

open  it  and  take  a  look.   A  sample  database .  src   file  is  shown  below. 

(  :  source 

: version  3 

: ip- address  "129.362.28.5" 
: ip-name    " server -name : where ever . com" 
: tcp-port  210 

: database-name    " /wai  s / indexes /mydat a " 
: maintainor  "whoever@whereever.com" 
: description 

"This   is  where  you  put  a  textual  description  of  the 

database  you  are  defining.    Be   sure   to   include  a  description  of 

any  tagged  fields  you  may  have  defined  for  the  database .  This 

the   information  users  will   turn  to  decide  whether  or  not  to 

search  your   database. " 

) 

The  values  in  each  field  will  be  different  from  those  shown  above. 
Check  them  and  correct  any  incorrect  values.    The  waisindex  program 
does  its  best  to  fill  in  the  correct  information,  but  you  probably  need  to 
change  something.    In  particular,  be  sure  to  check  the  ip-address,  ip- 
name,  and  tcp-port  values.   Also  check  the  email  address  for  the  official 
maintainer.    Next,  add  a  textual  description  of  the  database.    It  should 


